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Dynamic shifts in transcription factor binding are central to the regulation of biological processes by allowing rapid 
changes in gene transcription. However, very few genome-wide studies have examined how transcription factor occu- 
pancy is coordinated temporally in vivo in higher animals. Here, we quantified the genome-wide binding patterns of two 
key hepatocyte transcription factors, CEBPA and CEBPB [also known as C/EBPaipha and C/EBPbeta), at multiple time 
points during the highly dynamic process of liver regeneration elicited by partial hepatectomy in mouse. Combining these 
profiles with RNA polymerase 11 binding data, we find three temporal classes of transcription factor binding to be 
associated with distinct sets of regulated genes involved in the acute phase response, metabolic/ homeostatic functions, or 
cell cycle progression. Moreover, we demonstrate a previously unrecognized early phase of homeostatic gene expression 
prior to S-phase entry. By analyzing the three classes of CEBP bound regions, we uncovered mutually exclusive sets of 
sequence motifs, suggesting temporal codes of CEBP recruitment by differential cobinding with other factors. These 
findings were validated by sequential ChlP experiments involving a panel of central transcription factors and /or by 
comparison to external ChlP-seq data. Our quantitative investigation not only provides in vivo evidence for the in- 
volvement of many new factors in liver regeneration but also points to similarities in the circuitries regulating self-renewal 
of differentiated cells. Taken together, our work emphasizes the power of global temporal analyses of transcription factor 
occupancy to elucidate mechanisms regulating dynamic biological processes in complex higher organisms. 



[Supplemental material is available for this article.] 

Most biological processes — such as embryonic development, dif- 
ferentiation, or cellular responses to external stimuli — are in es- 
sence dynamic, which requires the underlying transcriptional 
networks to be tightly temporally coordinated. Several large-scale 
temporal studies have focused on the comprehensive mapping of 
dynamic changes of gene expression but have not revealed how 
coordination is achieved on the transcriptional level (e.g., van 
Wageningen et al. 2010). To address this question globally, the 
binding of regulatory proteins has been examined using methods 
such as chromatin immunoprecipitation with microarrays (ChlP- 
chip) or sequencing (ChlP-seq) (e.g., Sandmann et al. 2006a; 
Johnson et al. 2007; Vogel et al. 2007). Still, many such studies 
have operated with just one or two conditions. The exceptions 
include temporal examinations in the model systems Drosophila 
and yeast (Sandmann et al. 2006b; Jakobsen et al. 2007; Ni et al. 
2009; Zinzen et al. 2009). So far, very few in vivo, multi-time point 
investigations of transcription factor (TF) binding have been un- 
dertaken in higher vertebrates. 
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Mammalian liver regeneration is a well-studied process, in 
which the large majority of mature hepatocytes rapidly and in 
a highly synchronized manner re-enter the cell cycle upon injury 
(Fausto et al. 2006; Michalopoulos 2007; Malato et al. 2011). Serial 
transplantation of liver tissue has demonstrated a very high 
"repopulating" capacity of hepatocytes (Overturf et al. 1997), 
which lends hope to using these cells in regenerative medicine. 
The ability of mature liver cells to proliferate is reminiscent of 
specific, differentiated cells of the immune system (naive T cells, B 
cells) that are kept in quiescence until exposed to specific stimuli 
(Glynne et al. 2000; Yusuf and Fruman 2003; Feng et al. 2008). 
However, it remains open whether similar programs control the 
proliferation of hepatocytes and immune cells. In the liver, a 
number of studies have mapped temporal changes in mRNA levels 
during regeneration (e.g., White et al. 2005). This, coupled with 
functional studies, has led to the identification of several TFs in- 
volved in the regenerative response (for review, see Kurinna and 
Barton 2011). Still, knowledge about how these factors are coor- 
dinated temporally throughout the regenerative process is limited. 

CEBPA (C/EBPalpha) and CEBPB (C/EBPbeta) are two key 
hepatocyte TFs known to have divergent roles in liver function and 
regeneration. The two factors belong to the same basic region 
leucine zipper-family (bZIP), and several studies have shown that 
they bind the same core DNA sequence, acting as either homo- or 
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heterodimers (Diehl and Yang 1994; Rana 
et al. 1995; Osada et al. 1996). While 
CEBPA is highly expressed in the quies- 
cent condition (before injury) and regu- 
lates many metabolic liver genes, CEBPB 
is up-regulated during liver regrowth and 
is required for a full regenerative response 
(Greenbaum et al. 1995; Wang et al. 1995). 
In many tissues, CEBPA is observed to be 
an anti-proliferative factor facilitating dif- 
ferentiation, while CEBPB has been found 
to be either pro- or anti-proliferative in 
different settings (Porse et al. 2001; Nerlov 
2007). In the skin, the two factors appear 
to act redundantly to limit epidermal stem 
cell activity (Lopez et al. 2009). 

In the current study, we have ex- 
amined dynamic TF binding during liver 
regeneration in the mouse by performing 
a time course of ChlP-seq experiments to 
map and quantify binding of CEBPA and 
CEBPB on a genome-wide scale at a high 
level of temporal resolution. We find that 
CEBPA and CEBPB generally occupy the 
same positions in the genome of hepato- 
cytes in vivo, but they do so with quanti- 
tatively divergent temporal patterns during 
regeneration. 

To further dissect the dynamic tran- 
scriptional network behind liver regen- 
eration, we interrogated the temporally 
defined groups of CEBP-bound elements 
for regulatory properties, both with re- 
spect to sequence composition and dif- 
ferential expression of associated genes. 

Results 

A time course of CEBPA and CEBPB 
in vivo Chip experiments reveals three 
distinct patterns of binding 

Liver regeneration in rodents has been 
studied in detail using partial hepatec- 
tomy, in which three of five lobes of the 
liver are resected. The hepatocytes in the 
remaining liver lobes undergo up to two 
cell cycles during the first week of liver 
regeneration, hereby reestablishing pre- 
surgical liver mass (for reviews, see Fausto 
et al. 2006; Michalopoulos 2007). To ex- 
amine the binding dynamics of CEBPA 
and CEBPB during liver regeneration, we 
harvested regenerating liver tissue at 
eight time points (0, 3, 8, 16, 24, 36, 48, 
and 168 h; Methods). These time points 
cover the quiescent state (Go), several 
stages of the first cell cycle growth phase 

(Gi), the Gi-S-phase transition at 36 h, a later time point at 48 h, as 
well as the terminal phase point of 168 h (Fig. lA; Matsuo et al. 
2003; Fausto et al. 2006). Livers from five mice for each time point 
were subjected to ChIP using specific CEBPA or CEBPB antibodies 
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Figure 1 . A ChlP-seq time course of mouse liver regeneration. (A) Overview of the experimental time 
course. Eighttime points covering the full week of the regenerative response; phases of the first cell cycle 
indicated. Previous studies show a prolonged decrease in the CEBPA to CEBPB ratio. (B) Experimental 
outline for ChIP experiments. Partial hepatectomy was carried out on five mice for each time point. The 
regenerating liver tissue was harvested after variable time intervals, fixed, and subjected to ChlP-seq. 
(C) Normalized coverage maps of four gene proximal regions with ChlP-seq peaks, marked by dashed 
vertical lines. Each row represents a time point for either CEBPA (top eight rows, dark purple) or CEBPB 
(bottom eight rows, dark pink). Gene loci are shown in black below. (D) CEBPA and CEBPB temporal 
binding profiles based on genomic coverage, generated from the peaks in the C (Alb-enh) Albumin 
proximal region; (Mdcl) mediator of DNA damage checkpoint 1; (Orml) orosomucoid 1; (Cpsl) 
carbamoyl-phosphate synthetase 1 . 



(Methods) (Fig. IB). We confirmed antibody specificity by doing 
Chip in mice deficient for Cebpa or Cebpb or by including epitope 
blocking peptides (Supplemental Fig. SI). To minimize experi- 
mental variation, we pooled DNA precipitated from each of the five 



Genome Research 593 



www.genome.org 



]akobsen et al. 



mice in equal amounts and sequenced the combined material. We 
mapped the sequences to the mouse genome (mm9) and deter- 
mined base pair coverage after internal normalization to the total 
mapped read count for each antibody (Supplemental Methods; Sup- 
plemental Fig. S2; Supplemental Table SI). Genomic regions bound by 
either CEBPA or CEBPB were identified with the Useq peak-finder al- 
gorithm using a mock immunoprecipitation (IP) control (Supple- 
mental Methods, Supplemental Table S2). We validated IP consistency 
of two series of three independent ChlPs at four different CEBP bound 
genomic locations (Supplemental Fig. S3). For further validation, we 
performed de novo motif searches on several data sets and a conser- 
vation score analysis centering on CEBP motifs, all of which con- 
firmed the quality of our ChlP-seq data (Supplemental Fig. S4). 

Next, CEBPA or CEBPB peaks for each time point were used to 
construct a sum list of all enriched regions. Four examples of 
enriched regions located just upstream of gene loci are shown in 
Figure 1, depicting genomic coverage for the CEBPA and CEBPB 
Chip time series (Fig. IC). 
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After applying a stringent filtering regimen, we identified 
a total of 11,314 high-confidence regions bound by CEBPA or 
CEBPB, many of which show highly dynamic occupancy during 
the course of regeneration (for full peak calling numbers and re- 
gion coverage, see Supplemental Tables S2, S3; for filtering, see 
Supplemental Methods). We assembled temporal binding profiles 
for all bound regions based on maximal genomic coverage. As il- 
lustrated in Figure 1 for the four regions mentioned above, distinct 
profile patterns can be observed (Fig. ID). 

The high temporal resolution of the obtained binding profiles 
allowed us to query for groups of putative c/s-regulatory regions 
with similar occupancy dynamics. By hierarchical clustering, we 
identified three prevalent clusters (Fig. 2A), containing roughly 
equal numbers of bound regions (3549, 2818, and 3034 for the A, 
B, and C clusters, respectively). Two clusters (A and B) were defined 
by strong CEBPB binding, with maxima either at the 3- and 36-h 
(A) or at the 3-h (B) time points, which is also evident in a summed 
profile of temporal coverage (Fig. 2B). As opposed to the A and B 
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Figure 2. Three distinct temporal binding patterns reflect the CEBPA and CEBPB protein level ratios. (A) Heatmap of hierarchical clustering on CEBPA 
and CEBPB binding profiles, based on genomic coverage in each region (row) and time point (column). White signifies low binding level (coverage 0); dark 
blue, high (cut-off at coverage 1 20). The most prominent clusters are marked A, B, and C. (B) Summarized coverage profiles of the three clusters. For each 
profile, summed time point levels are shown as normalized to the highest point (set to 1 ). (C) Western blot showing CEBPA and CEBPB levels through the 
first 36 h of liver regeneration. Isoforms indicated (CEBPA: p42 and p30; CEBPB: LAP*, LAP, and LIP). Loading control is histone H3. Experiment is 
representative of three independent repeats. (D) Representative quantification of protein levels based on signal intensities (using ImageJ). All isoforms 
included for protein level determination, normalized to a maximum value of 1 . 
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clusters, the C cluster was characterized by a high degree of tem- 
poral correspondence between the CEBPA and CEBPB binding 
levels, with maxima at the quiescent state 0-h time point and at 24 
h (Fig. 2A,B). The detected robust binding of CEBPA at the 16- and 
24-h time points was unexpected, as previous studies have re- 
ported decreased protein levels for this TF throughout the re- 
generative process (Greenbaum et al. 1995, 1998). To clarify this, 
we examined the protein levels for CEBPA and CEBPB until the first 
round of replication (time points, 0-36 h) by Western blotting 
(Fig. 2C). This highlighted a clear correlation between the CEBPA 
to CEBPB protein level ratios and the observed binding patterns. 
Specifically, we detected high levels of CEBPB at the 3- and 36-h time 
points, high CEBPA levels at the quiescent state (0 h), and a return 
of CEBPA predominance at 16- and 24-h time points (Fig. 2C,D). 

The three binding pattern clusters are associated with specific 
functional classes of genes 

The temporally distinct CEBP binding patterns of putative cis- 
regulatory regions belonging to the three clusters suggest divergent 
regulatory roles through the regenerative process. To address this 
possibility, we investigated whether each cluster was associated 
with specific sets of differentially expressed genes. To this end, we 
performed ChlP-seq experiments with liver tissue from the eight 
time points outlined above with an antibody specific to RNA 
polymerase II (POL2). In contrast to examining steady-state mRNA 
levels, this allowed us to directly measure the transcriptional ac- 
tivity at any given time point as POL2 binding to each gene body 
(Supplemental Methods; Sandoval et al. 2004). To pinpoint genes 
regulated by CEBPA and/or CEBPB, a single putative target gene 
was assigned to each bound region, based on proximity to the 
transcription start site (TSS) of neighboring genes (Supplemental 
Methods). Genes associated with regions belonging to one of the 
three binding profile clusters (A, B, or C cluster genes) were in- 
spected for differential expression (for a full target gene list, see 
Supplemental Table S4). 

For an initial overview, we counted differentially expressed 
genes from each cluster, defined as genes with a change in POL2 
gene body coverage above twofold, comparing each time point to 
0 h (Fig. 3A). This revealed that all clusters are associated with 
prominent gene expression changes at the 3-h time point. The C 
cluster was associated with a large group of down-regulated genes 
at 3 h and, furthermore, displayed up-regulation of many genes at 
24 h. In contrast, the A and B clusters mostly exhibit down-regulation 
at the 24-h time point and up-regulation early in the time course 
(3 and 8 h), as well as a resurge at 36 h (mainly the A cluster). 

Next, we focused the analysis on the 0-, 3-, 24-, and 36-h time 
points, as these displayed the most pronounced TF binding dif- 
ferences between the three clusters and therefore are likely to 
represent the most distinct regulatory states. Groups of genes, up- 
or down-regulated from one time point to the next (i.e., from 0-3 
h, 3-24 h, and 24-36 h), were subjected to gene ontology (GO) 
analysis with the online DAVID tool (Supplemental Methods) 
(Huang da et al. 2009). As summarized in Table 1, we observed ex- 
tensive differences in functional classes (GO terms. Biological Process) 
of differentially expressed genes associated with the three binding 
patterns (for full GO analysis lists, see Supplemental Table S5). 

At the 0- to 3-h transition, the A cluster displays almost equal 
numbers of genes up- and down-regulated (554 vs. 682) (Table 1). 
In contrast, the B and C clusters showed mainly decreased target 
gene activity, with 610 and 709 down-regulated genes (63% and 
70% of total regulated genes, respectively). The down-regulated 
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Figure 3. The A, B, and C cluster regions target distinct sets of genes. 
(A) Heatmap of gene expression changes through liver regeneration. "Up" 
and "down" bins count genes associated with a CEBP cluster (A, B, or C), 
exceeding a 2.0-fold change threshold (POL2 binding level change rela- 
tive to 0 h). Blue color intensity indicates number of genes going up; gray 
intensity, genes going down. (B) Representative examples of transcrip- 
tional activity (polymerase II [POL2] binding) of putative target genes. 
CEBPB binding illustrates gene association with either the C (Cpsi) or A 
(Mdcl) cluster exclusively (upper left and right panel, respectively) or as- 
sociation with peaks belonging to both B and C clusters (Adh7 and Ormi; 
bottom panels). Gene loci are shown in black below; binding levels for 
CEBPB and POL2, in dark pink and red, respectively. Black numbers in- 
dicate POL2 gene body coverage for time point comparison. (AdhT) Al- 
cohol dehydrogenase 7; for additional gene symbols, see Figure 1 . 



genes associated with both B and C regions are annotated with 
lipid metabolism or oxidation-reduction terms, while the activated 
genes are associated with acute phase or inflammatory response. 
This shift of gene activity is expected, as the liver shifts from 
a quiescent, homeostatic state to an acute stage as a response to 
injury. By inspection of shared B and C target gene loci, we found 
several examples of B and C peaks in close proximity to each other. 
These often bind CEBPs at mutually exclusive time points, as ex- 
emplified in Figure 3B (lower panels). This observation suggests that 
CEBP complexes, possibly via interaction with specific co-activators 
or repressors at B or C regions, could have opposing actions on 
transcription, leading to timed fine-tuning of gene activity. 

Prominent gene expression changes from 3 to 24 h include 
a significant down-regulation of A cluster genes with the term 
"transcriptional regulatory activity," a shift that is paralleled by 
a decrease of A cluster CEBPA and CEBPB occupancy. This may 
suggest that the expression of these regulatory factors is dependent 
on CEBP binding. Another finding was the up-regulation of genes 
targeted by C cluster regulatory regions (889 of 1209 differentially 
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expressed genes). Enriched C cluster GO terms were consistent 
with a resurge of homeostatic and metabolic gene activity before 
S-phase entry (Table 1). One notable example is Cpsl, carbamoyl- 
phosphate synthetase 1, which encodes a rate-limiting enzyme in 
the urea-cycle (Jones 1965) and displays a pronounced up-regulation 
at the 24-h time point (Fig. 3B, upper left panel). 

At the 36-h time point, the A cluster associated genes dis- 
played a strong bias toward increased transcriptional activity, with 
545 up-regulated against only 73 down-regulated, in contrast to 
the C group target genes with 1 77 up and 196 down. In the large set 
of up-regulated A cluster genes, we found a strong overrep- 
resentation of cell cycle GO terms (Table 1). An example of a cell 
cycle regulator gene (Mdcl) targeted by an A cluster enhancer is 
shown in Figure 3 (Fig. 3B, upper right panel). The B cluster shows 
association with relatively few regulated genes at this time point 
(272 up and 77 down). 

These data collectively show the three clusters to be associated 
with defined sets of genes with distinct biological functions and 
timing of expression. 

Motif analysis identifies specific and mutually exclusive sets 
of regulatory code 

The targeting of distinct sets of genes and the distinct binding 
patterns suggest the presence of divergent regulatory features in 
the regions constituting the three temporal clusters. Combinato- 
rial binding of multiple TFs has been proposed to explain dynamic 
changes in occupancy and gene regulation during development 
(e.g., Zinzen et al. 2009). 

To examine this possibility in our system, we carried out an 
analysis of the collective set of sequences of each binding cluster 
(A, B, or C) versus a background set. We counted instances of 
known TF cognate sequences to generate probability scores using 
the ASAP software (Marstrand et al. 2008). In order to attain com- 
prehensive coverage with minimal redundancy, the analysis was 
performed using a condensate of three publicly available databases 
of TF binding sequences or position weight matrixes (PWMs) (Sup- 
plemental Methods; Supplemental Table S6). Probability Z-scores 
for all 246 condensed PWMs with representation above a threshold 
were used for hierarchical clustering to examine the general differ- 
ences among the three clusters (Fig. 4A). Key examples are shown 
with full Z-score information (Fig. 4B). 

As expected, we found CEBP motifs to be among the most 
enriched for all of the binding profile clusters (Fig. 4A,B; Supple- 
mental Table S6). From the hierarchical clustering, it was evident 
that the majority of motifs found in the A set were shared with the 
B set, while a small group of motifs was found to have robust 
overrepresentation in all the three clusters (Fig. 4A). Prominent 
among these were motifs recognized by CREB and HNF4A (Fig. 
4A,B), two TFs known to have a wide set of target genes in hepa- 
tocytes and central roles in liver metabolism and development 
(Costa et al. 2003; Montminy et al. 2004; Bolotin et al. 2010). 

Notably many motifs displayed a clear mutually exclusive 
pattern of overrepresentation (Fig. 4A). Specifically, many motifs 
found in the A and B clusters of c/s-regulatory elements were un- 
derrepresented in the C cluster and vice versa. Binding sequences 
of the A and B regions belong to TFs related to stress response, 
proliferation, or cell cycle regulation, such as E2F, EGRl and MYC 
(c-myc), the hypoxia response factor HIFIA, the metal response 
factor MTFl, and Kruppel-like factors (KLFs) (Fig. 4B; Supplemental 
Table S6; Lichtlen and Schaffner 2001; Blais and Dynlacht 2004; 
Liao et al. 2004; Filers and Eisenman 2008; Majmundar et al. 2010; 



McConnell and Yang 2010; Zwang et al. 2011). The C regions, on 
the other hand, contain matches to PWMs of liver-specific 
TFs— such as FOXA2, ONECUTl (previously known as HNF6), and 
HNFIA (Costa et al. 2003; Guillaumond et al. 2010)— or factors 
found to be associated with a quiescent state (Go), e.g., MAFB and 
F0X03A (Greer and Brunet 2005; Sarrazin et al. 2009). Addition- 
ally, several different SOX motifs were identified as strongly over- 
represented within this cluster (Fig. 4B; Supplemental Table S6). 

Overall, these observations suggest significant differences in 
the regulatory mechanics of the A/B clusters versus the C cluster of 
putative c/s-regulatory elements. Moreover, the data point to sets 
of specific, dynamic TF binding partners of CEBPs, defining a 
temporal regulatory code. 

Multi-level support of temporal c/5-reguIatory code predictions 

To assess if the TFs predicted to interact with A or C cluster regions are 
present in the liver, we examined their expression levels (POL2 gene 
body reads) against all genes (Fig. 4C; Supplemental Table S7). This 
revealed that the majority are highly expressed, being represented 
by either direct matches or top protein family members (e.g., Klfl 0, 
-13, -15 or Soxl3, -15, -18). Strikingly, Cehpa, Cehph, andMafbare at 
the top of the list (rank positions of 107, 21, and 61, respectively). 

We further examined the indicated differences of the three 
clusters by interrogating their genomic position relative to the 
nearest TSS (Fig. 4D). The three clusters clearly diverge in this re- 
spect, as the C cluster regions are distinct from A (or B) regions by 
showing no proximity to TSSs. Several studies have shown differ- 
ences in preferred genomic positioning of TFs (e.g., Gerstein et al. 
2012). This suggests, based solely on the genomic difference in 
position, that C cluster regions recruit other cobinding factors than 
the A (or B) regions. Specifically, E2Fs and MYC have been shown 
to bind at positions similar to the A regions, supporting the rele- 
vancy of enrichment for E2F and MYC cognate sequences in the A 
cluster (e.g., Filers and Eisenman 2008). 

Next, we took advantage of published mouse ChlP-seq data 
sets (Chen et al. 2008; Hoffman et al. 2010; Laudadio et al. 2012) to 
test if a panel of factors bound to A versus C regions as predicted by 
our computational analysis. We find our analysis to be supported 
as the E2F1, KLF4, and MYC factor bound regions overlap signifi- 
cantly more with A regions, while FOXA2 and ONECUTl prefer- 
entially bind C regions (Fig. 4E; Supplemental Methods). 

Finally, we tested co-occupancy of CEBPs and several putative 
cof actors by performing sequential ChIP (Methods). This shows that 
the factor E2F3 binds to the two A regions Slbp and Cbx5 simul- 
taneously with CEBP factors, but none of the examined C regions, 
in accordance with our predictions. In contrast, the C cluster- 
associated factors ONECUTl, HNFIA, and MAFB interact with sev- 
eral C regions also occupied by CEBPs (Fig. 4F). Noticeably, a number 
of target regions are shared among the three C region factors. 

Two modes of transcriptional regulation by EGRl 

EGRl has been shown to be essential for a timely regenerative re- 
sponse in the mouse liver (Liao et al. 2004) and represents a classical 
"immediate early TF," which is up-regulated upon a growth stimulus 
(for review, see Thiel and Cibelli 2002). Recently, it was found to be 
part of a growth-signal discriminatory circuit together with TP53 
(Zwang et al. 201 1). Canonical EGRl cognate sequences are among 
the most highly enriched in the "cell cycle" or "proliferation" set of 
CEBP bound regulatory regions (the A cluster), together with E2F 
and MYC motifs (Supplemental Table S6). 
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To investigate the role of EGRl in liver regeneration in vivo, 
we performed ChlP-seq experiments with EGRl -specific antibodies 
at the 24- and 36-h time points. Unexpectedly, we observed a 
preference of EGRl for the C set over the A set regions (Fig. 5 A). 
Accordingly, A and EGRl peak summits were generally positioned 



further apart than C and EGRl peaks (Fig. 5B). Moreover, a GO 
analysis of putative target genes suggested that EGRl may be in- 
volved in all aspects of liver regeneration, as acute phase genes, 
metabolic genes, and cell cycle genes are found proximal to EGRl 
bound regions (Supplemental Table S9; data not shown). 



598 Genome Research 



www.genome.org 



Regulation of liver regeneration 





Figure 5. Two modes of EGR1 cobinding at A and C cluster regions. (A) Bar diagram displaying overlap between peaks defined by EGR1 binding (24 h or 
36 h, yellow bars) and CEBP binding. A or C cluster CEBP peak overlaps are shown (white numbers indicate proportions) with P-values of hypergeometric 
tests (Fisher's one-tailed) of similarity. (B) Diagram showing the distribution of distances between CEBP peak summits and the closest EGR1 (36-h) peak. A 
(gray) and C (blue) cluster peaks and a randomly distributed set (dashed line) are included. (C) CEBP and EGR1 ChIP read coverage and PWM hit tracks 
from representative A (fop panels) and C (bottom panels) cluster regions. Vertical red bars indicate CEBP and EGRl PWM hits; height of bars identity score; 
and transparent black bars show position of peak summits. (D) EGRl -ChIP using livers of mice deficient for Cebpb and wild-type livers. (K-axis) Ratio of 
enrichment for Cebpb-null versus wild-type experiments. EGRl bound regions overlapping with CEPB A cluster (gray) and C cluster (blue) peaks are shown. 
N = 3-4. Error bars, SEM. (**) P < 0.01 , Mann-Whitney test, two-tailed. For gene names, see Supplemental Table SI 0. 



By manual inspection of tightly overlapping EGRl and C re- 
gions bound by CEBPs, we found many peaks positioned precisely 
at a CEBP sequence but lacking EGRl cognate sequence (Fig. 5C, 
bottom panels). Conversely, EGRl peaks that possessed EGRl 
cognate sequences did not overlap exactly with the CEBP peaks, 
and these peaks were consistently associated with A cluster regions 
(Fig. 5C, top panels). Sequences of target regions with positions of 
CEBP and EGRl PWM hits and logos of used PWMs can be found in 
Supplemental Figure S5. 

To test if EGRl depends on CEBPs for interacting with DNA at 
C regions (lacking EGRl sequences), we performed EGRl -specific 
Chip with liver tissue from wild- type and Cebpb knockout mice. 
Our results show that EGRl binding depends significantly more on 
CEBPB at C regions than at A regions, suggesting that EGRl could 
bind DNA in two modes, indirectly (via CEBPs) at C regions and 
directly at A regions (Fig. 5D). 

Differential sets of transcriptional regulators targeted 
by the three clusters of c/5-regulatory regions 

To further explore the transcriptional network of liver regeneration, 
we focused on the 120 most highly expressed CEBP target genes 
annotated with the "transcriptional regulator, DNA-dependent" 



GO term (Supplemental Table S8). These we represent as targets 
of either A, B, or C cluster regions or of any combination hereof 
(Fig. 6A). 

We find that the large majority of regulators (110 of 120) are 
targeted by either the A or B cluster regions, while the C cluster 
targets much fewer (38), suggesting a massive shift at the tran- 
scriptional network level upon transition from a quiescent to a re- 
generative state. Moreover, the injury response clusters (A/B) also 
impinge on many of the regulators targeted at the quiescent stage 
(29 of 38 bound by the C cluster), hinting at tight integration of the 
genetic programs at successive stages of the complex regenerative 
process. 

A number of genes are associated with a high number of CEBP 
bound regions belonging to two or three clusters (Fig. 6A), sug- 
gesting a complex c/s-regulatory structure. This may suggest that 
they are important components, or "hubs," of the network. Prom- 
inent examples are the genes for the bHLH DNA-binding dominant 
repressor, ID2, as well as the cell cycle regulators JUN and EGRl 
(Fig. 6A,B). Also Cebpa and Cebpb themselves are targets of multiple 
bound regions, as are Mafb, Hnf4a, and several Klf genes (Fig. 6A). 

Several genes of regulatory factors are targeted by regions of 
which they may be coregulators based on the binding motif analysis 
(Figs. 4B, 6A). This can be interpreted as network feed-forward 
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Figure 6. The A, B, and C cluster regions target distinct sets of transcriptional regulator encoding genes. (A) Regulatory network showing transcription 
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or auto-regulatory loops (Fig. 6C). Examples include genes 
encoding E2Fs and KLFs for the A and B cluster regions and FOX 
factors for the C regions. Hnf4a is a shared target of regions from 
all three clusters, and the HNF4A binding sequence is also 
enriched in all clusters (Fig. 6A; Supplemental Tables S6, S7). These 
regulatory loops may be involved in adding robustness to the 
network as suggested for other transcriptional hierarchies (Lee 
et al. 2002). 

We detect the cholesterol NrlhS (also known as Lxr) and bile 
acid Nrlh4 (also known as Fxr) metabolism regulatory genes to be A 
and C cluster targets, respectively (Fig. 6A; Supplemental Table S8; 
Kalaany and Mangelsdorf 2006). We also find the gene encoding 



RXRA, the obligate binding partner of both TFs, to be targeted by 
multiple CEBP bound elements. This suggests that these factors 
may be interconnected with the CEBPs in regulation of choles- 
terol and bile metabolism during liver regeneration. 

Finally we note that almost all core members of the circadian 
clock system (Zhang and Kay 2010) turn out to be putative CEBP 
targets (Fig. 6A; Supplemental Table S8), e.g., the core TF genes 
Clock and Bmall (also known as Arntl), as well as the downstream 
effectors encoded by Perl/2/3 and Cryl/2, and secondary compo- 
nent genes Rora, Rorc, Nrld2 (also known as Rev-erb beta), Dbp, and 
Nfil3 (also known as E4bp4). This points to a high level of cross-talk 
between circadian clock and regenerative response regulation. 



600 Genome Research 



www.genome.org 



Regulation of liver regeneration 



Discussion 

The multi-time point global quantification of TF occupancy en- 
abled us to study the temporal dynamics of regulation in vivo. We 
find that the two central transcriptional regulators CEBPA and 
CEBPB interact with a large group of c/s-regulatory elements in 
a highly dynamic manner through the progressive phases of liver 
regeneration. This group of genomic regions can be subdivided 
by CEBP binding dynamics into three clusters with distinctly 
enriched sets of sequence motifs, pointing to molecular mecha- 
nisms governing differential CEBP binding. Guided by this ob- 
servation, we validate the predicted binding preference of several 
key cobinding TPs with sequential ChIP and published ChlP-seq 
data. 

An important aspect of our study is the integration of ob- 
served TF binding dynamics and expression of putative target 
genes. We find that each of the three binding clusters is associated 
with distinct sets of regulated target genes, i.e., acute phase genes, 
metabolic/homeostatic genes, and cell cycle-related genes (Table 1). 
This demonstrates the possibility of linking dynamic TF occupancy 
patterns with the shifting phases of the regenerative program 
(Fig. 7A). Moreover, the concomitant resurge of CEBP binding to 
the C cluster regions and metabolic gene activity at 24 h provides 
evidence for a previously uncharacterized, early "homeostatic'' 
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Figure 7. Protein level ratios of CEBPA versus CEBPB may define a dy- 
namic transcriptional switch. (A) Schematic diagram showing the di- 
vergent binding patterns and biological roles of the A, B, and C cluster c/s- 
regulatory elements through the first cell cycle of liver regeneration. A 
previously uncharacterized early resurge of metabolic/homeostatic genes, 
associated with C cluster regions with binding peaking at 24 h, was ob- 
served. (B) Model of a transcriptional switch centered on the relative ratio 
of CEBPA and CEBPB determining the composition of the CEBP complex 
pool (homo- or heterodimers). Gene activating or repressive actions are 
indicated, as well as sets of transcription factors found to be associated 
with A or C putative enhancers. Metabolic genes are induced and acute 
phase genes repressed by binding in a CEBPA-high setting (C regions), 
while the opposite is true for the CEBPB-high setting (B regions). A-type 
regions are only targeted by CEBPs when the CEBPB form is abundant. 



phase of liver regeneration, taking place before the first round of 
replication. 

The observed shifting phases of CEBP binding is closely 
temporally coupled to the dynamic ratio of CEBPA to CEBPB (Fig. 
2C,D). One possible interpretation is that this ratio is involved in 
dictating enhancer occupancy by CEBP complexes. As such, a high 
ratio of CEBPA to CEBPB would lead to binding of C cluster cis- 
regulatory elements enhancing metabolic and repressing acute 
phase response genes. Conversely, a low ratio would direct binding 
of B cluster elements that repress metabolic or activate acute phase 
genes or of A cluster elements that activate cell cycle genes. This 
differential recruitment of CEBPs could be explained by in- 
teraction with distinct sets of TFs, depending on CEBP complex 
composition. This model is summarized in Figure 7B. Our obser- 
vations and model contradict the conventional view that CEBPA 
levels are low while CEBPB levels are high throughout the re- 
generative process (Greenbaum et al. 1995, 1998), and emphasizes 
the requirement for a high degree of temporal resolution in studies 
of dynamic biological processes. 

Many cognate sequences associated with TFs not previously 
known to be involved in liver regeneration were found to be 
overrepresented in the CEBP bound regions (Supplemental Table 
S6). In the C cluster regions, motifs of several SOX factors were 
abundant (Fig. 4B). Recent findings implicate SOX factors in he- 
patocyte differentiation and stem cell biology (Duan et al. 2010; 
Furuyama et al. 2011). Our data find Soxl8, -13, and -15 to be 
highly expressed (Fig. 4C; Supplemental Table S7), but exactly 
which SOX factors cooperate with CEBPs in the liver remains to be 
determined. In the A cluster, we found overrepresented KLF factor 
motifs, and several highly expressed (e.g., KlflO, -13, and -15) 
are targets of CEBP A cluster enhancers (Figs. 4B,C, 6 A; Supple- 
mental Table S8). Liver functions were recently identified for 
KLF15 and KLFIO (Guillaumond et al. 2010; Takashima et al. 
2010). Moreover, KLF factors are central for self-renewal programs 
in embryonic stem cells (Takahashi and Yamanaka 2006; Jiang 
et al. 2008), which suggests that they may play similar roles in the 
self-renewal circuitry of fully differentiated hepatocytes, very likely 
as both binding partners and targets of CEBPs. 

An unexpected observation of this study was that EGRl 
binding preferentially colocalizes with the C cluster regions (Fig. 
5A), which are almost depleted of EGRl cognate sequence (Fig. 4B; 
Supplemental Table S6). Our data support a model of indirect or 
"assisted" EGRl binding to explain this observation (Fig. 5C,D). 
Only one example of indirect EGRl binding via CEBPB has been 
published (Zhang et al. 2003), but our genome-wide observations 
indicate that EGRl targeting of metabolic gene promoters (C 
cluster regions) via CEBPs could be a general phenomenon in the 
liver. EGRl has also been found to be important for interpretation 
of mitogenic signals (Zwang et al. 2011) and targets A cluster re- 
gions in this study (Fig. 5A,D; Supplemental Table S9). Hence, the 
observed CEBPA-to-CEBPB shift in the CEBP pool could regulate 
EGRl recruitment to A or C cluster c/s-regulatory elements and 
thus ensure a tight temporal control of EGRl action in line with 
the progressive phases of liver regeneration. 

In essence, liver regeneration upon partial hepatectomy is 
compensatory growth, as the cells proliferating are fully differen- 
tiated hepatocytes. Previous studies have shown that liver re- 
generation does not involve loss of hepatocyte differentiation state 
(e.g., Malato et al. 2011; for reviews, see Fausto et al. 2006; 
Michalopoulos 2007). This is consistent with our finding that 
many hepatocyte-specific genes, e.g., involved in metabolic func- 
tions, are up-regulated rather than down-regulated. (Fig. 3B; Table 
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1). Only few examples are known of differentiated cells capable of 
hepatocyte-like "self-renewal" in the adult mammalian organism. 
It has recently been shown that the quiescence of naive T cells is 
actively enforced by a balance between FOXPl and FOXOl, 
allowing swift expansion upon external cues (Feng et al. 2011). 
Similarly, deficiency of just two TFs lends long-term, non- 
tumorigenic expansion capacity to mature macrophages (Aziz 
et al. 2009). These two factors are MAF (c-maf) and MAFB, both of 
which we find to be putative CEBP targets in the liver (Fig. 6A; 
Supplemental Table S8). MAFB is among the most highly expressed 
TFs in the quiescent condition and is strongly down-regulated 
from the 8-h time point (Supplemental Tables S4, S8). Further- 
more, we find MAFB and MAFK binding sequences enriched in the 
quiescence/metabolism-associated C cluster of enhancers, pre- 
dominantly bound by CEBPs at the 0- and 24-h time points. In the 
proliferation-associated A cluster, we find KLF and MYC sequences 
(Fig. 4B; Supplemental Table S6); KLF4 and MYC were demon- 
strated to be required for expansion of the differentiated macro- 
phages (Aziz et al. 2009). Moreover, CEBPA is a key determinant of 
macrophage as well as liver cell differentiation (e.g., Feng et al. 
2008). These observations may suggest that similar transcriptional 
networks centered on a CEBP/MAF axis are enforcing quiescence 
in both fully differentiated hepatocytes and macrophages. Un- 
derstanding this circuitry has the potential to be of use in re- 
generative medicine as a step toward manipulation of hepatocytes 
or other fully differentiated cells for therapeutic purposes. 

In conclusion, the present work shows how time-resolved 
analysis of two core TFs of liver regeneration can reveal specific 
aspects of the regulation operating at distinct phases during the 
process. In a broader sense, our approach of analyzing dynamic TF 
occupancy globally in vivo should be applicable to other experi- 
mental systems and holds promise to aid in the elucidation of 
complex transcriptional networks in higher vertebrates. 

Methods 

Mouse work 

The partial hepatectomy was performed on wild-type mice 
(C57BL6/J, male, 7 wk old) by removing three of five liver lobes 
according to the method described earlier (Thoren et al. 2010). 
Mice were culled after varying amounts of time covering the 
regrowth phase of liver regeneration (0, 3, 8, 16, 24, 36, 48, and 168 
h) (Fig. 1A,B). Livers from five individual mice for each time point 
were harvested, generating 40 samples; directly snap-frozen in 
liquid nitrogen; and stored at -80°C. Animal experiments con- 
formed to institutional as well as Danish national guidelines. 

Chromatin immunoprecipitation 

After thawing, tissue samples were homogenized by douncing 
(loose pestle, Wheaton 15-mL douncer) in cold PBS, and were 
cross-linked for 10 min in 1% formaldehyde using a rotator. 
Chromatin was fragmented by sonication (Bioruptor, Diagenode). 
The 40 samples were used in ChlP-seq experiments with antibodies 
against CEBPA, CEBPB, EGRl (Santa Cruz: sc-61, sc-150, sc-llOx), 
RNA-POL2 subunit Bl antibody (AC-055-100, Diagenode), and 
Mock IgG (Sigma 18140). ChIP was performed according to the 
method described earlier (Sandmann et al. 2006a). Sequential ChIP 
was done according to the method described previously (Truax and 
Greer 2012), with the modification of cross-linking antibody- 
Protein A beads (www.neb.com). Santa Cruz antibodies as above 
were used for sequential CEBPA-CEBPB ChIP (Supplemental Fig. 
S6). An in-house CEBP antibody (JSJ#1052) recognizing both 



CEBPA and CEBPB was used in the first round in sequential ChIP 
with antibodies against ONECUTl (HNF6), HNFIA, E2F3, MAFB, 
or IgG for the second round (Santa Cruz: sc-13050x, sc-8986x, sc- 
878x; Novus Bio: nb600-266). All sequential ChIP enrichments 
were normalized to IgG ratios. Enrichment was validated by qPCR 
(ABI Prism 7000 or Roche Lightcycler 480), using ratios of a posi- 
tive detector primer set versus a negative (Mamstr, Chr_12_desertl 
or Sfi2). All primer sets are listed in Supplemental Table SIO. 

Clustering of CEBPA and CEBPB peaks 

Consensus regions for CEBPA and CEBPB peaks for all time points 
were defined by merging overlapping regions between sets, re- 
quiring a called peak in at least one sample, producing a total of 
87,049 consensus regions. Subsequently, coverage, defined as the 
maximal coverage level for each region, was determined using the 
normalized counts (Supplemental Material). Filtering was applied, 
requiring normalized coverage of a minimum of 50 reads in at least 
one sample for any given region, reducing the set to 11,314. Hi- 
erarchical clustering was performed in MeV (http://www.tm4.org/ 
mev/) (Saeed et al. 2006), and three predominant clusters were 
identified: A (3449 regions), B (2818), and C (3034). The remaining 
2013 regions were excluded from further analysis. Coverage reads 
were summed for each time point/cluster, normalized, and used for 
displaying sum coverage tracks (Fig. 2B). Consensus region cover- 
age data can be found in Supplemental Table S3. 

Data access 

The timeseries ChlP-seq data generated for this work have been 
deposited in the NCBl Gene Expression Omnibus (GEO) (http:// 
www.ncbi.nlm.nih.gov/geo/) and are accessible through GEO Series 
accession number GSE42321. 
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